|
|
Accession Number |
TCMCG075C25098 |
gbkey |
CDS |
Protein Id |
XP_007011717.2 |
Location |
complement(join(796964..797465,797883..798154,798352..799302,799760..800051,800515..801398,801581..801793,802071..802511)) |
Gene |
LOC18587707 |
GeneID |
18587707 |
Organism |
Theobroma cacao |
|
|
Length |
1184aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_007011655.2
|
Definition |
PREDICTED: DNA-directed RNA polymerases IV and V subunit 2 [Theobroma cacao] |
CDS: ATGGGGGCGTCAGCAGATGCTAAAGCAAATAGAGCTGATATCGATTTAGATGTAGATGATGATGTCTGTGATGAGGTTCTCAGTGTGCAACAGCTGGGAGAAGAGTTTTTGAGGGGTTTCTGTAAACAAGCGGCCGTGTCTTTCTTCAAAGAGTATGGCCTCATCAGTCATCAGCTCAACTCTTACAATGCTTTCATCAAATATGGTCTGCAGAACACCTTTGATTCTTTTGGAGAGTTCCTTATTCATTCTGGGTATGACCCGTCGAAGAAAGGAGAAGGTGACTGGCGTCATGCTAGAGTGAGGTTTGGGAAGGTCACTGTTGAGCGGCCAACCTTTTGGGCTGTTTCTGGAGGCAATGAGCTCAACATGCTTCCTAGGCATGCACGCCTTCAGAACATGACGTATTCCTCCAGGATGAAGGTTAATGTTGACCTTCAGGTATACACGGCAAAAAGTGTTAAGAGTGACAAGTTTAAAACTGGAAGAGAAGAGTTTGTTGAGGAGGAGGTTGTGTATCAAGATAACAGGGATATTATAATTGGGAGGATCCCTGTGATGGTGAGGTCTGACCTATGCTGGATGAATGAAGTTGAAAAAGCTGATTGTGATTTTGATCATGGAGGCTATTTTCTGATCAAGGGTACAGAAAAGATATTCATTGCACAAGAGCAAATTTCTATGAAGAGACTCTGGATTTCAAATAGCCAGGGTTGGACAATTGCTTACAGGTCGGAGGTGAAGAGAAACAGATTAATTATTAGACTAGTGGAGAATTCTAAAGTTGAATATATCAAGGGGGGAGAGAAAGTCCTCACTGTTTATTTCTTGTCAACGGAGATCCCTGTATGGGTTTTGTTTTTTGCCCTTGGTGTATCATCAGACAAAGAAGTTGTAAATCTGATTGATTATGAAAGCAATGATTCTAGTATAACAAACATACTGTTTGCCTCAATTCGTAATGCTGATGGGAAATGTTATAAATTTTGTCAAGGAAGAAATGCTATTGATTATGTAGGCAAGTTAGTCAAAGATACCAGATTTCCACCTGAAGAAGGTATTGAAGAGTGCCTTAGCACATATCTGTTTCCCACTCTGCGTAGTTTCAAGCAAAAAGCTCGCTTCCTAGGGTACATGGTAAAGTGTCTCTTGCAGGCCTACACTGGTCGCCTAAAATGTGATAATAGAGATGATTTTAGGAACAAGAGGCTAGAGTTAGCAGGTGAGCTGCTTGAGCGTGAACTGAAAGTTCACATTGCCCATGCCAGGAGGCGTATGGCTAAGACTCTGCAGAGAGATCTTTATGCAGATCGTAATGTTCGTCCTATTGAGCACTATCTTGATGCGTCTATAGTTACAAATGGACTTTCAAGAGCATTTTCTACTGGAGCCTGGTCTCATCCTTATAAAAGGATGGAAAGGATTTCAGGAGTTGTGGCAAATCTTGGACGAGCAAATCCATTGCAGACAATGGTTGATTTGAGGAAAACACGTCAACAGGTTCAGTACACTGGGAAGGTTGGAGATGCAAGATACCCACACCCTTCTCACTGGGGAAAAGTTTGCTTCCTCTCCACTCCAGATGGTGAAAATTGTGGGCTTGTAAAAAATCTGGCCACCACAGGACTTGTGAGTACAAACATAATGGAATCCATAGTTGACAAGTTGTTTGATTCCGGAATGGAGGAACTGGTTAATGATACTTGTAGCTCACTTGATGGGAAAGATAAAGTCTTTTTAAATGGGGAATGGGTTGGGGTTTGTGAAGATTCCCTTTCGTTTGCTGCTGAGGTTAGAAGAAAGCGACGCAGTAAAGAATTTCCGCATCAGGTGGAAATCAAAAGAGATGAACATAAAGGAGAAGTGCGCATCTTTTCTGATGGTGGAAGGATTCTGCGTCCTCTTTTAGTTGTTGACAACTTAAACAGAACAAAAGCATTTAAGGGGGAAAATTACACCTTCCAGGCTCTTTTAGAAGGAGGGATAATTGAGCTTGTTGGAACTGAAGAAGAAGAGGACTGTCGAACTGCATGGAGTATTAAGTATCTTTTAACAGATGTTGAGGGGAAGCAGCCTGTTAAGTATACTCATTGTGAGCTTGACATGTCATTTCTTTTGGGTTTAAGCTGTGGGATCATTCCATTTGCAAATCATGATCATGCAAGGAGAGTCCTCTATCAGGCACAGAAGCATTCTCAACAGGCCATTGGGTTTTCTACAACAAACCCCAACATTAGAGTTGATACTTTGTCACACCAATTATATTACCCCCAAAGGCCACTGTTTCGTACAATGACATCTGATTGTCTTGGAAAACTGGGACATCCTCTGGGTCAGAAGGGAGTGCTACCGAAGCCAGAATTATACAATGGCCAGAATGCTATTGTGGCGGTCAATGTTCATCTTGGATACAACCAAGAGGATTCCTTGGTAATGAACCGAGCCTCTTTAGAGCGTGGAATGTTCCGTTCTGAACACGTAAGAAGTTACAAAGCAGAAGTTGATAACAAGGAAATTCAGGATAAGAGGCGGAAGTCTGAAGATATTGTAAATTTTGGAAAAATACAAAGTAAGATTGGACGTGTGGACAGCCTTGATGATGATGGTTTTCCTTATGTTGGTGCTAACCTGCAGTGTGGTGACATTGTCATTGGGAGGTGTGCAGAGTCAGGAGCTGATCATAGTATAAAACTGAAGCACACTGAAAGAGGCATGGTTCAGAAAGTTGTGTTATCTTCCAATGATGATGGGAAAAATTATGCTGTGGTATCTCTGAGACAGGTTCGTTCTCCCTGTCTTGGAGACAAATTTTCAAGCATGCATGGGCAAAAGGGTGTTCTTGGTTTTCTGGAGTCTCAAGAGAATTTTCCTTTCACAACTCAAGGAATAGTTCCTGATATTGTAATTAACCCACATGCATTCCCTTCACGACAAACTCCAGGTCAACTCTTGGAGGCTGCTTTGGGAAAGGGGATTGCCTGTGGTGGGTCAATGAAATATGCCACCCCTTTCTCCACTATTTCTGTAGATGCCATCACAGAACAGCTTCACAGGGCTGGATTTTCAAGATGGGGAAATGAGAGAGTTTACAATGGTCGAACTGGTGAGATGGTTCGTTCACTCATATTTATGGGTCCAACATTCTACCAGCGTCTGATCCACATGGCTGAAGACAAAGTGAAATTTCGTAACACTGGACCTGTACACCCGCTTACAAGGCAGCCAGTTGCTGACCGGAAACGTTATGGTGGGATCAAATTTGGTGAGATGGAGCGTGACTGTCTCATTGCTCACGGTGCATCAGCCAACTTGCACGAGCGTCTCGTCACACTCAGTGATTCCTCCCAGATGCATGTTTGCCGCAACTGTAAAAATGTTGCAAATGTGATTGAACGGGCAGTACCAGGTGGTCGAAAGATCAGGGGTCCCTACTGCCGGGGTTGCCAGTCGGTGGATGACATTGTCAGGGTAAATGTTCCTTATGGTGCCAAGTTATTGTGCCAGGAGCTATTTAGTATGGGTATTAATCTGAAATTTGAAACCCAGCTTTGTTGA |
Protein: MGASADAKANRADIDLDVDDDVCDEVLSVQQLGEEFLRGFCKQAAVSFFKEYGLISHQLNSYNAFIKYGLQNTFDSFGEFLIHSGYDPSKKGEGDWRHARVRFGKVTVERPTFWAVSGGNELNMLPRHARLQNMTYSSRMKVNVDLQVYTAKSVKSDKFKTGREEFVEEEVVYQDNRDIIIGRIPVMVRSDLCWMNEVEKADCDFDHGGYFLIKGTEKIFIAQEQISMKRLWISNSQGWTIAYRSEVKRNRLIIRLVENSKVEYIKGGEKVLTVYFLSTEIPVWVLFFALGVSSDKEVVNLIDYESNDSSITNILFASIRNADGKCYKFCQGRNAIDYVGKLVKDTRFPPEEGIEECLSTYLFPTLRSFKQKARFLGYMVKCLLQAYTGRLKCDNRDDFRNKRLELAGELLERELKVHIAHARRRMAKTLQRDLYADRNVRPIEHYLDASIVTNGLSRAFSTGAWSHPYKRMERISGVVANLGRANPLQTMVDLRKTRQQVQYTGKVGDARYPHPSHWGKVCFLSTPDGENCGLVKNLATTGLVSTNIMESIVDKLFDSGMEELVNDTCSSLDGKDKVFLNGEWVGVCEDSLSFAAEVRRKRRSKEFPHQVEIKRDEHKGEVRIFSDGGRILRPLLVVDNLNRTKAFKGENYTFQALLEGGIIELVGTEEEEDCRTAWSIKYLLTDVEGKQPVKYTHCELDMSFLLGLSCGIIPFANHDHARRVLYQAQKHSQQAIGFSTTNPNIRVDTLSHQLYYPQRPLFRTMTSDCLGKLGHPLGQKGVLPKPELYNGQNAIVAVNVHLGYNQEDSLVMNRASLERGMFRSEHVRSYKAEVDNKEIQDKRRKSEDIVNFGKIQSKIGRVDSLDDDGFPYVGANLQCGDIVIGRCAESGADHSIKLKHTERGMVQKVVLSSNDDGKNYAVVSLRQVRSPCLGDKFSSMHGQKGVLGFLESQENFPFTTQGIVPDIVINPHAFPSRQTPGQLLEAALGKGIACGGSMKYATPFSTISVDAITEQLHRAGFSRWGNERVYNGRTGEMVRSLIFMGPTFYQRLIHMAEDKVKFRNTGPVHPLTRQPVADRKRYGGIKFGEMERDCLIAHGASANLHERLVTLSDSSQMHVCRNCKNVANVIERAVPGGRKIRGPYCRGCQSVDDIVRVNVPYGAKLLCQELFSMGINLKFETQLC |